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Abstract 

Social ( orfolksonomic) tagging has become a very popu- 
lar way to describe content within Web 2.0 websites. Unlike 
taxonomies, which overimpose a hierarchical categorisa- 
tion of content, folksonomies enable end-users to freely cre- 
ate and choose the categories (in this case, tags) that best 
describe some content. However, as tags are informally de- 
fined, continually changing, and ungoverned, social tagging 
has often been criticised for lowering, rather than increas- 
ing, the efficiency of searching, due to the number of syn- 
onyms, homonyms, polysemy, as well as the heterogeneity of 
users and the noise they introduce. To address this issue, a 
variety of approaches have been proposed that recommend 
users what tags to use, both when labelling and when look- 
ing for resources. As we illustrate in this paper, real world 
folksonomies are characterized by power law distributions 
of tags, over which commonly used similarity metrics, in- 
cluding the Jaccard coefficient and the cosine similarity, fail 
to compute. We thus propose a novel metric, specifically 
developed to capture similarity in large-scale folksonomies, 
that is based on a mutual reinforcement principle: that is, 
two tags are deemed similar if they have been associated to 
similar resources, and vice-versa two resources are deemed 
similar if they have been labelled by similar tags. We of- 
fer an efficient realisation of this similarity metric, and as- 
sess its quality experimentally, by comparing it against co- 
sine similarity, on three large-scale datasets, namely Bib- 
sonomy, MovieLens and CiteULike. 



1 Introduction 

The rise of Web 2.0 has transformed users from passive con- 
sumers to active producers of content. This has exponen- 
tially increased the amount of information that is available 
to users, from videos on sites like YouTube and MySpace, 
to pictures on Flickx, music on Last.fm, blogs on Blogger, 



and so on. This content is no longer categorised according 
to pre-defined taxonomies (or ontologies). Rather, a new 
trend called social (or folksonomic) tagging has emerged, 
and quickly become the most popular way to describe con- 
tent within Web 2.0 websites. Unlike taxonomies, which 
overimpose a hierarchical categorisation of content, folk- 
sonomies empower end users by enabling them to freely 
create and choose the tags that best describe a piece of infor- 
mation (a picture, a blog entry, a video clip, etc.). However, 
this freedom comes at a cost: since tags are informally de- 
fined, continually changing, and ungoverned, finding con- 
tent of interest has become a main challenge, because of the 
number of synonyms, homonyms, polysemy, as well as the 
inevitable heterogeneity of users and the noise they intro- 
duce. 

In order to assist users finding content of their own in- 
terest within this information abundance, new techniques, 
inspired by traditional recommender systems, have been 
developed: for example, whenever a user searches from 
some content using query tags {ti, . . . ,t„i}, new tags 
{tm+i, ■ ■ ■ , tm+n} are being added to the query, based on 
their similarity to their original query tags. This is done to 
increase the chances of finding content of relevance in these 
extremely sparse settings. Various metrics have been used 
to compute the similarity among folksonomy entities, in- 
cluding, for instance, cosine similarity, Jaccard coefficient, 
and Pearson Correlation. Performance results demonstrate 
an increase in accuracy and coverage of searches when us- 
ing these techniques; however, evaluation has been con- 
ducted on manipulated datasets so to obtain a much denser 
one. We argue that such manipulations alter the nature of 
real folksonomies, and indeed eliminate the problem, rather 
than solving it. 

Unmodified real-world folksonomies are characterized 
by two key properties: the power law distribution of tags, 
and the non-independence of data. Empirical studies Hl|5] 
illustrate that tag usage in folksonomies follows a power 
law distribution; this means that, if we were select any two 
tags, the probabiUty that the resources jointly labelled by 
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them is non-zero is extremely low. As a result, comput- 
ing tag similarity on un-modified folksonomies, using tra- 
ditional metrics like cosine similarity, would almost always 
yield close-to-zero values, thus failing to support users in 
retrieving resources relevant to their queries. Furthermore, 
metrics like cosine assume that tags are semantically inde- 
pendent of each other; once again, this assumption does not 
hold in real folksonomies, where tags may be synonyms to 
each other 

In this paper, we propose a novel similarity metric that 
can be used to accurately quantify tag similarity in large- 
scale real-world folksonomies (Section O. This similar- 
ity metric is computed following an iterative algorithm, 
grounded on a mutual reinforcement principle: that is, two 
tags are similar if they label similar resources, and vice- 
versa, two resources are similar if they have been labelled 
by similar tags. We describe an efficient realisation of this 
similarity metric (Section |4|i, and empirically quantify its 
quick convergence on three large-scale datasets, namely 
BibSonomjQ, MovieLenfl and CiteULik^ We measure 
Precision and Recall of our metric, and compare it to co- 
sine similarity on these unprocessed datasets (Section |5]l. 
Our findings demonstrate that, when considering our un- 
manipulated datasets, the performance of our novel similar- 
ity metric provides higher Precision and Recall w.rt. the 
cosine similarity. Section |6] covers related works on simi- 
larity measures, mainly applied to folksonomies. Finally, in 
Section|7]we draw our conclusions. 

2 Background 

In this section, we formally introduce some concepts that 
will be extensively used in the following, when presenting 
our approach. The first concept we consider is that of a 
folksonomy lfT2l : 

Definition 2.1 Let US ~ {ui, . . . , Un^} be a set of users, 
let RS — {ri , . . . , } be a set of resource URIs and let 
TS = {ti, . . . ,tnt} be a set of tags. A folksonomy F is a 
tuple F = {US, RS, TS, AS), where AS CUSxRSxTS 
is a ternary relationship called tag assignment set 

In this definition we do not make any assumption about 
the nature of resources; they could be URLs (like in De- 
licious), photos (as in Flickr), music files (as in Last.fm), 
documents (as in CiteULike), and so on. 

According to Definition 12. II a folksonomy F is a "three- 
dimensional" data structure whose "dimensions" are repre- 
sented by users, tags and resources. In particular, an el- 
ement a E AS is a triple {u,r,t), indicating that user u 

Ijhttp : / /www . bibsonomy . org/| 
"^http : //www .movielens . org/ 
Ihttp : //www ■ citeulike ■ org/| 



labelled resource r with tag t. To simplify modeling and 
management of folksonomies, their inherent tripartite graph 
structure is often mapped into three matrices, whereby each 
matrix models one relationship at a time (i.e., between tags 
and resources, tags and users, and resources and users) [ 19 |. 
In this paper, we adopt the same matrix-based representa- 
tion. Specifically, being rir, fit and ri„ the number of re- 
sources, tags and users respectively, we represent a folk- 
sonomy as the following three matrices: 

• TR (Tag-Resource): ant x rir matrix such that TR^j 
is the number of times the tag i labelled resource j; 

• TU (Tag-User): a rit x Uu matrix such that TUy is 
the number of times the tag i has been used by user j; 

• RU (Resource-User): a rir x matrix such that 
RUij is the number of times resource i has been la- 
belled by the user j. 

Tag similarity within a folksonomy can then be com- 
puted by looking at the resources these tags have been at- 
tached to. In particular, each tag ti can be mapped onto a 
vector tr{i) corresponding to the i-th row of TR. Given 
an arbitrary pair of tags ti and tj, their similarity s{ti,tj) 
can be computed as the cosine similarity (CS) of the vec- 
tors tr{i) and tr{j): 

" " V(t.«,t.«)V(t.(j),t.(j)) 

being (•, •) the usual inner product in R""^. 

Cosine similarity has been successfully applied in the 
context of Information Retrieval liTSI . Within a folksonomy. 
Equation [T] states that the similarity score of a pair of tags 
is high if they jointly co-occur in labelling the same subset 
of resources. However, two key properties of folksonomies, 
that are, ( i) the power law distribution of tags and ( //) their 
non-independence, cause Equation[T]to yield very poor re- 
sults in this domain, as we shall discuss next. 

Power Law in Tag Distribution. Let us consider a real- 
world folksonomy like BibSonomy. BibSonomy fTTlfTsl is 
a social bookmarking service in which users are allowed to 
tag both URLs and scientific papers. A power law distribu- 
tion of tags on scientific references emerges. In particular, 
resources were described by no more than 5 different tags 
(roughly 81%), and usually less than 3 (roughly 58%). A 
small portion of frequently adopted tags used to bookmark 
scientific references, and a long tail of tags (roughly 81%) 
being used less than 5 times. 

Following the above observations, matrix TR is rather 
sparse; thus, if we were to select any pair of tags ti and 
tj, most of the components of the corresponding vectors 
tr{i) and tr{j) would be and, therefore their inner product 
would be close to 0. The cosine similarity between any ti 



and tj would therefore be almost 0, regardless of the initial 
choice of ti and tj . Such counter-intuitive result is an effect 
of the inadequacy of cosine similarity to capture properties 
of tags in large-scale real foUcsonomies. 

Non-Independence of Tags. Cosine similarity implic- 
itly assumes that the components of the vectors appearing in 
Equation[T]are independent of each other. Such an assump- 
tion does not often hold true. For instance, consider a folk- 
sonomy consisting of two resources ri and r2, representing 
two different scientific papers, both discussing about folk- 
sonomies. Suppose that the paper associated with r2 is an 
extension of the paper associated with ri . Finally, assume 
to bookmark the resource ri with the tag ti = "folkson- 
omy" and to bookmark the resource r2 with the tag t2 = 
"social tagging". In this case, the similarity between ti and 
t2 computed according to Equation[T]would be 0, even if ti 
and t2 should result similar each other. The mutual similar- 
ity between ti and t2 can be assessed only if we consider 
the non-independence of the resources they label. 



3 Approach Description 

In this section, we present a new definition of tag (and re- 
source) similarity, that is particularly suited to quantify sim- 
ilarity of elements (be them tags of resources) in datasets 
characterized by power law distribution and non-indepen- 
dence of data. Our definition of similarity relies on the mu- 
tual reinforcement principle: 

Two tags are similar if they label similar re- 
sources, and conversely, two resources are sim- 
ilar if they are labelled by similar tags. 

In the following, we shall derive a mathematical formula 
to compute tag and resource similarity on the basis of the 
principle stated above. After this, we shall illustrate why 
our formula is able to effectively address the power law and 
non-independence challenges. 

We designed an iterative algorithm to compute the sim- 
ilarity score. In the base case, given a pair of tags {ta, tb) 
and a pair of resources (r^, ri,), we define the tag similarity 
st'^(ta,tb) and the resources similarity sr'^(ra,rb) as fol- 
lows: 

st"{ta,tb) = 5ab sr"{ra,rb) = Sab (2) 

being Sab the Kronecker symbol Equation |2] reflects the 
fact that, in the initial step, each tag (resp., resource) is sim- 
ilar only to itself and it is dissimilar to all other tags (resp., 
resources). 

At the fc-th step, let st''^^{ta,tb) (resp., sr''^^{ra,rb)) 
be the tag (resp., resource) similarity between the tags ta 



and tb (resp., resources and rb). We apply the following 
rules to update st''~^{ta,tb) (resp., sr'^~^(ra, rb))'. 



St^{ta,tb) = 

sr''{ra,rb) = ■ 



ST'^ita.tb) 



^ST^{ta,ta)*^ST^{tb,tb) 

SRHra,rb) 

) * ^/ SR^{rb,rb) 



(3) 
(4) 



where: 



*We recall that the Kronecker symbol S^b is equal to 1 if a and b coin- 
cide and otherwise. 



ST\ta,tb) = ™« * * sr'=-i(r,;, r,) * TKbj (5) 

SR^{ra,rb) = TR,:a * % * st'^-^t^.t,) * TR.fc (6) 

Here 5'^ is equal to 1 if i = j and it is equal Xo ijj if i ^ j, 
where tp (called propagation factor) is a value belonging to 
the interval [0, 1] G M. 

Equations [3]B1 rely on the following intuitions. Given a 
pair of tags {ta , tb), at the k iteration, we consider all pair of 
resources {ri,rj) in our folksonomy and we take their sim- 
ilarity sr'^~^{ri,rj) into account to compute st''{ta, tb). In 
particular, we compute a weighted sum of all the similarity 
values sr^~'^{ri, rj), where the weights reflect the strength 
of the association between the tag ta and the resource r^, 
and the tag tb and the resource rj . As a consequence, the 
higher the similarity between and rj , the higher the con- 
tribution of the association between the tag ta and the re- 
source Ti, and the tag tb and the resource rj. Finally, the 
term is instrumental to give higher relevance to tags 
that labelled the very same resources, w.rt. the fact that 
they labelled two similar (but different) resources. 

Note that, in the special case in which ip = 0, our 
method does not depend on k and Equations |3]-E]reduce to 
the cosine similarity formulation. In fact, in this particular 
case, all the contributions sr^~^{ri^rj) and st^~^{ri,rj) 
are disregarded when i ^ j, and are taken into considera- 
tion only when i = j. Since all contributions sr^~^{ri^ ri) 
and st^^^{ri,ri) are equal to 1 by definition, it follows that 
Equations [3]-Slreduce to the cosine similarity formulation. 

Equations [3]-Sl are able to effectively address the power 
law and non-independence of data challenges we outlined 
above. In fact: 

• In the computation of tag (resp., resource) similarity, 
we leverage on the similarity of all pairs of resource 
(resp., tag) similarities. As a consequence, unlike co- 
sine similarity, we do not restrict ourselves to consider 
only the resources jointly labelled by two tags (resp., 
the tags jointly labelling two resources), which can be 
few, but we iteratively propagate similarity scores by 
considering all the pairs of similar resources jointly la- 
belled by the two tags (resp., all the pairs of similar 



tags jointly labelling two resources). In this way we 
are able to face the power law occurring in tag usage. 

In our definition of similarity, if two tags label sim- 
ilar, even if not coincident, resources their similarity 
score will be greater than 0, whereas the cosine sim- 
ilarity would return 0. As a consequence, our simi- 
larity method takes into account forms of correlation 
among pairs of resources and/or tags rather than as- 
suming their independence. 



4 Realization 

From a computational standpoint. Equations [3]-2] could en- 
tail a large overhead for two reasons: 

• From a theoretical standpoint, our approach may need 
an infinite number of iterations. As a consequence, we 
need a stopping criterion allowing us to safely termi- 
nate the execution of Equations [3}S1 after a finite (and 
low) number of iterations. 

• Equation |3] (resp.. Equation |4| requires the computa- 
tion of ri^ resource-resource (resp,. n1 tag-tag) simi- 
larities, at each fc-th step. This could make our simi- 
larity measure inapplicable in practical cases, because 
each iteration requires exactly x n1 computations. 



where: 



Fortunately, there are two important results making our 
similarity measure applicable and entailing the same com- 
plexity level as cosine similarity. The first result can be 
stated by the theorem showed and proved in the Appendijfl 
which affirms that the sequences st*'{ta,tb) and sr'^(ra, r^) 
defined as in Equations [3]-H] converge. 

This theorem ensures that, after a certain number of it- 
erations. Equations [3]-0] converge to stable values. During 
experimentation conducted on three real folksonomies (see 
Section |5Tt , we empirically found that convergence was 
achieved after as little as five iterations, thus suggesting that 
our similarity measure is applicable in practical cases. 

Furthermore, Equations [3]-0] can be defined, without any 
loss of generality, as a simple matrix product (such as in 
cosine similarity). Specifically, let st*^ and sr'^ be the tag- 
tag and resource-resource similarity matrices respectively, 
with st'' = It and sr° — 1^; here st° = It (resp., sr*^ = 1^) 
is the rit x rit (resp., x n^) identity matrix. If we indicate 
with the symbol "o" the Hadamard matrix product |7 1, at 
the fc-th step, the st*^ and sr''' matrices can be computed as: 



St" = ST'^ o DT'^ 



(7) 
(8) 



'Seelhttp : / /tinyurl . com/proof-seke20 1 1| 

^Given two matrices A and B of the same dimensions, tlie Hadamard 

product A o B is a matrix of the same dimensions of A and B and it is 

defined as follows: (A o B)^^ = A^j ■ B^^- 



grpfc = xR X (*r o sr^-^) X TR* 
SR*-' = TR* X (*t o st'=-i) X TR 
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(9) 
(10) 
(11) 
(12) 



In the above equations, we have indicated with 
(resp., 'S'f ) a square matrix rir x rir (resp., rit x "-*) where all 
the elements are set to ijj, with the exception of the diagonal 
where the elements are set to 1 ; the symbol TR* represents 
the transpose of matrix TR. We have thus reduced the com- 
putational complexity of each iterative step from x to 
a simple matrix product; this reduction, coupled with the 
empirical observation that 5 iterative steps are sufficient to 
find convergence, makes our similarity metrics suitable in 
practical contexts. The last question that needs answering 
is how effective (in terms of Precision and Recall) our simi- 
larity metric is w.rt. traditional ones like cosine. We answer 
this question next. 



5 Experiments 

In order to evaluate the performance of our similarity mea- 
sure, we built a prototype in Java and MySQL and we con- 
ducted experiments using three well known social tagging 
websites: Bibsonomy, CiteULike, and MovieLens. The ex- 
periments we carried out aimed to answer the following 
question: 

If we consider any two tags U and tj belonging to 
a folksonomy, is our similarity measure capable 
of accurately assessing the extent to which they 
are related (similar) each other? And can it do 
so even when such tags have been drawn from the 
long tail of low popularity tags? 

5.1 The Dataset 

To answer the above question, we conducted experiments 
on the following three datasets. 

Bibsonomy. Bibsonomy is a social bookmarking web- 
site promoting the sharing of both scientific reference and 
general URL. We downloaded a snapshot of the website in 
June 2009, containing bookmarks made between January 
1999 and June 2009. 

CiteULike. CiteULike is a social bookmarking website 
that aims to promote and develop the sharing of scientific 
references amongst researchers. CiteULike enables scien- 
tists to organize their libraries with freely chosen tags which 
produce a folksonomy of academic interests. CiteULike 



Dataset 


Users 


Resources 


Tags 


Bookmarks 


Bibsonomy 


4,696 


578,587 


147,076 


648,924 


CiteULike 


57.053 


1,928,302 


401,620 


2,281,609 


MovieLens 


4,009 


7,601 


15,240 


55,484 



Table 1 : Features of our datasets 



runs a daily process which produces a snapshot summary of 
what articles have been posted by whom and with what tags 
up to that day. We downloaded one such archive in Novem- 
ber 2009, containing bookmarks made between November 
2004 to November 2009. 

MovieLens. MovieLens is a rate-based recommenda- 
tion website that suggests to users movies they might like. 
We downloaded such dataset in January 2009, containing 
bookmarks made from December 2005 to January 2009. 

Table[T]summarizes the features of the involved datasets. 

5.2 Simulation Setup 

Our experimental investigation aimed to quantify, in each 
of the above datasets, the extent to which our similarity 
measure was capable of identifying related tags, especially 
when tags were drawn from the long tail. To investigate 
this, for each dataset of Table [T] has been used as follows. 
We split it into two different sets, called test set and train 
set. Each train set was composed of 90% random book- 
marks taken from the involved dataset; we used these book- 
marks for training purposes. Test sets contained the remain- 
ing 10% of bookmarks which were used for testing. Each 
bookmark in a test set has then been used as a query; specif- 
ically, if the number of tags in such bookmark was large 
enough, then these were split into two different sets - if 
possible of the same size - called tSetg (query tag set) and 
tSetE {expected tag set). In our experiments, a bookmark 
was considered large enough if it had at least 3 tags associ- 
ated. Tags composing tSetq were used to query the train 
set; in particular, we selected from the train set the k tags 
most similar to tags belonging to tSetg, according to two 
metrics: the one we proposed in Section[3] and cosine sim- 
ilarity, which we used as benchmark. We denote this set as 
tSetji (result tag set). The value of k was chosen equal to 
the size of the expected set in such a way that tSetj^ and 
tSetE had the same size. Finally, we compared tSetn with 
tSetE- the higher the overlap between tSetn and tSetE, 
the more effective the similarity measure in identifying re- 
lated tags. This follows the intuition that, if a user associ- 
ated a set of tags to a certain resource, such tags are related 
to each other (that is, tSet e contains tags related to those 
contained in tSetq). 

To quantitatively evaluate our similarity measure, we 
computed two metrics commonly used in Information Re- 



Propagation 


Bibsonomy 


CiteULike 


MovieLens 


ip = Q 


0.100638896 


0.057922233 


0.075126961 


= 0.15 


0.128318833 


0.063290603 


0.112358995 


i(! = 0.3 


0.139761842 


0.070652236 


0.115026291 


■ip = 0.6 


0.140748308 


0.079320913 


0.115534133 



Table 2: Precision values in our datasets 



Propagation 


Bibsonomy 


CiteULike 


MovieLens 


ip = Q 


0.100625714 


0.057864697 


0.075143054 


ip = 0.15 


0.128373044 


0.063273342 


0.110927464 


ij = 0.3 


0.139939546 


0.070634975 


0.119373901 


ip = 0.6 


0.140429576 


0.079303652 


0.119949092 



Table 3: Recall values in our datasets 



trieval, namely Precision and Recall |[T|: 

Prec.s^on = ^^^^^ (13) 
Recall = (14) 

We computed Precision and Recall values for each test 
bookmark; we repeated this process 10 times over different 
train and test splits of the datasets. The results we present 
next are averages of such runs. 

5.3 Results 

Tables |2] and [3] shows values of Precision and Recall we 
obtained by applying our similarity measure on the datasets 
of Table [T] for different values of tp (see Equations [5]^. 
The benchmark is our similarity measure with ijj — 0, that 
is, the case in which our similarity measure reduces into 
cosine similarity. 

From the analysis of Tables |2] and |3] we can draw the 
following main observation: in large scale folksonomies, 
classical approaches - such as cosine similarity (?/; = 0) - 
have difficulties finding similarity relationships among the 
tags belonging to the long tail, as their Precision and Re- 
call is lower than those achieved with our iterative approach 
for any value of V'. The considered datasets are character- 
ized by a very long and prominent tail of low popularity 
tags; in these real cases, out iterative measure of similarity 
produces Precision/Recall that is approximately 40% better 
than cosine similarity for BibSonomy and CiteULike, and 
approximately 50% better for MovieLens. 

6 Related Work 

In the last few years, folksonomies have been the subject 
of extensive research. An interesting survey on the charac- 
teristics of folksonomies can be found in |4|. One of the 
first investigations into the characteristics of folksonomies 



has been presented by Mathes fTSl : in that work, the author 
discusses advantages (e.g., simpHcity of use) and disadvan- 
tages (e.g., ambiguity, synonyms) of folksonomies, and in- 
vestigates the community aspects behind folksonomies, on 
two scenarios, FHckjQ and DeUcioufl 

Despite their easy-of-use, the lack of structure that char- 
acterises folksonomies makes it difficult to browse and find 
relevant content. To tackle this issue, the research com- 
munity has been actively researching techniques to support 
information retrieval. Approaches in this area have fol- 
lowed one of two streams: they have either tried to empir- 
ically derive an ontology from the underlying foUcsonomy, 
or they have tried to apply graph-exploration techniques on 
the foUcsonomy itself. 

Lambiotte |,14J and Mika tl9i , for example, were the 
first to extend the classic bipartite model of tag-resource 
towards a tripartite model, which takes into account both 
users (as actors), tags (as concepts) and resources (as in- 
stances); they showed that, by applying this model to De- 
licious, a lightweight ontology could be extracted from the 
underlying folksonomy. Similarly, |9| used similarity met- 
rics to reconstruct a concept hierarchy. 

Hotho et al. |[T2l l20l followed a different approach in- 
stead: they presented a formal model, which converts a 
folksonomy into an undirected weighted graph, and cou- 
pled it with a new search algorithm, namely "FolkRank", 
based on the well-known seminal "PageRank" [2|. They 
applied this algorithm to Delicious, and showed how it can 
be used as a tag recommender system. Other extensions of 
recommender systems to folksonomy structures have been 
explored 11211 [TOj : some of these have been assessed against 
one of the datasets we adopted in this study, namely Bib- 
Sonomy HUini. 

All the above approaches rely on a similarity measure 
to quantify tag relatedness. Measures which have been of- 
ten used in the literature include the Jaccard coefficient fSl, 
the cosine similarity [6J, and a number of improvements 
over it llT5ll22l . Liu et al. lITSl dwelt further into the prob- 
lem of computing similarities in folksonomies; in particu- 
lar, they questioned the common assumption that text cat- 
egorization can be mapped onto orthogonal spaces, due to 
problems of synonyms and ambiguities (as already figured 
out by 1.18.1 ). They then devised an improved similarity met- 
ric ("SNOS", Similarity equations in the Non-Orthogonal 
Space) which is optimized for comparing objects mapped 
onto non-orthogonal spaces, considering a principle of "mu- 
tual reinforcement" from which we drew inspiration in this 
work. They proved the convergence of this technique and 
experimentally investigated the performance of SNOS on 
synthetic datasets, such as the formerly called MSN search 



|http : //www ■ f lickr . com/| 
'http : //www . delicious . com/ 



engine (now, Bin^. Their novel metric was shown to out- 
perform the classic cosine similarity, if applied to the con- 
text of finding similar queries. Some of their findings are 
here extended to the domain of folksonomies. 

Similarity measures have often been evaluated on differ- 
ent datasets, making it difficult to assess their relative ad- 
vantages and disadvantage in different domains. Further- 
more, they have often been applied to manipulated datasets, 
making the comparison even more difficult. Indeed, in or- 
der to critically compare them, an evaluation framework 
has recently been proposed fiT], with the aim of provid- 
ing support to systematically compare several tag similar- 
ity measures, using data from Delicious 13 1. This work 
contributes to the assessment of the suitability of similarity 
measures to scenarios characterized by power-law distribu- 
tion of tags and non-independence of data, showing how 
traditional measures like cosine do not work, and proposing 
an alternative, iterative measure that provides good accu- 
racy instead. 

7 Conclusions 

In this paper, we have shown that real world folksonomies 
are characterized by power law distributions of tags and 
non-independence of data. Under these conditions, tra- 
ditional similarity measures like cosine similarity fail to 
capture tags relatedness. To remedy this, we have proposed 
a novel metric, specifically developed to capture similarity 
in large-scale folksonomies, that is based on the mutual 
reinforcement principle: that is, two tags are deemed 
similar if they have been associated to similar resources, 
and vice-versa two resources are deemed similar if they 
have been labelled by similar tags. We have described an 
efficient realisation of this similarity metric, and assessed 
its quality experimentally, by comparing it against cosine 
similarity, on three large-scale datasets, namely Bibson- 
omy, MovieLens and CiteULike. 
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